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Abstract 

Organochlorine insecticide hexachlorocyclohexane (HCH) has recently been classified as a 'Persistent Organic pollutant' by 
the Stockholm Convention. The LinB haloalkane dehalogenase is a key upstream enzyme in the recently evolved Lin 
pathway for the catabolism of HCH in bacteria. Here we report a sequence-structure-function analysis of ten naturally 
occurring and thirteen synthetic mutants of LinB. One of the synthetic mutants was found to have ~80 fold more activity 
for (3- and 8-hexachlorocyclohexane. Based on detailed biophysical calculations, molecular dynamics and ensemble docking 
calculations, we propose that the latter variant is more active because of alterations to the shape of its active site and 
increased conformational plasticity. 
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Introduction 

Isomers of hexachlorocyclohexane (HCH) have been widely 
used as insecticides in various agricultural systems around the 
world for most of the last 70 years [1,2]. Despite mounting 
concerns about their human and eco-toxicity, they still find heavy 
use in some developing countries in particular [2,3]. Only the y 
isomer is insecticidal but either a mixture of the four major isomers 
(a, (3, y and 5), known as technical HCH, or purified y-HCH, 
known as lindane, have been used commercially. The purification 
of lindane from technical HCH has led to massive dumps (> 
50,000 tonnes) of the other isomers in several countries and the 
stability of all isomers (but particularly of (3-HCH) has led to 
widespread contamination of the environment, originating both 
from the dump sites and broad scale agricultural uses [1^] . 

Over 60 bacterial strains which can degrade HCH, about half 
of them Sphingomonads, have now been reported, and all those 
characterized biochemically and genetically have proven to use the 
well established Lin pathway (encoded by various lin genes) to 
degrade HCH [2,5,6] . There are two major forms of the pathway 
which differ in their initial reactions but subsequently converge. 
One form operates on ot-, y- and 8-HCH and is initiated by two 



rounds of dehydrochlorination, followed by two rounds of 
hydrolytic dechlorination. The other form of the pathway operates 
on P- and 8-HCH and is initiated by two rounds of hydrolytic 
dechlorination, albeit some of the subsequent steps remain to be 
elucidated. The linA/UmA gene/enzyme system catalyzes dehy- 
drochlorinations in both forms of the pathway via an E2 reaction 
mechanism, and the Zinfi/LinB gene/enzyme system catalyzes the 
hydrolytic dechlorinations via Sn2 displacement reactions [2,5,6]. 

The LinB enzyme is a monomeric 32 kDa protein from the a/ (3 
hydrolase fold superfamily [7]. Its structure has been solved [8] 
and its catalytic mechanism is reasonably well understood, with an 
Asp-His-Glu catalytic triad mediating nucleophilic attack on the 
substrate and then on an acyl-enzyme intermediate, and an 
oxyanion hole stabilizing the intermediate [9-12]. The best 
characterized LinB, LinB UX26A from Sphingobium japonicum 
strain UT26, has a broad substrate specificity, mainly due to a 
large active site volume, which includes monochloroalkanes (C3- 
C10), dichloroalkanes, bromoalkanes and chlorinated aliphatic 
alcohols [7,10]. Notably, this variant yields a significantly lower 
specificity constant for fi-HCH (0.02 mM -1 s -1 ) as compared to 
another relatively well characterized LinB, namely, LuiBb9oa 
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(identical to LinB MI1 205, and LinB BHC -A and LinB pL Bi) from 
Sphingobium indicum strain B90A (0.20 mM -1 s -1 ) [13]. None- 
theless the activity of LinB B90A for P-HCH is much lower than that 
of LinBux26A for some of the other halogenated aliphatic 
compounds mentioned above (see [7,13] for a comprehensive list 
of kinetic constants). 

A total of ten naturally occurring LinB variants have now been 
described which differ by as many as 16 (5.4%) of their amino acid 
residues [2] . At least some of these variants differ qualitatively in 
their substrate specificities; in addition to the (5- and 8-HCH 
difference above, LinB B90A will hydrolytically dechlorinate the 
metabolite tetrachlorocyclohenol (TCDL), whereas LuiBtjt26 does 
not [14]. A molecular dynamics simulation study suggests that this 
is mainly due to a difference in the flexibility of the entrance of the 
substrate access tunnel mediated by six out of the seven amino acid 
differences between the two enzyme variants [13]. 

Given the low but quantitatively different activities of the best 
characterized LinB UX26 and LinB B90A variants for (5-and 8-HCH, 
and their quantitative differences in respect to TCDL, our aim in 
the work described herein has been to quantitatively assess the 
variation in the (3- and 8-HCH activities of all ten known naturally 
occurring LinB variants, plus another 1 3 synthetic variants derived 
on the basis of these data and the known structure of the LinB UX26 
protein. We find one synthetic variant with nearly 80-fold higher 
activity than LinB B9 () A for P-HCH and we suggest an explanation 
for its performance on the basis of increased mobility of its cap 
domain and increased affinity for the substrate. 

Materials and Methods 

Gene synthesis, expression vectors and chaperones 

Codon optimized UnB genes for expression in E. coli were 
synthesized by Geneart AG, Regensburg Germany (Table SI). 
These genes were PCR amplified with respective attBl, attB2 and 
attB2-R2 primers (Table S2) and the amplicons were then cloned 
into pDONR201 and transferred to pDEST17, following the 
manufacturer's instructions (Invitrogen, CA). The host E. coli 
BL21-AI (Invitrogen) cells for some clones co-expressed chaper- 
ones from the plasmid pGro7 (Takara, Japan). 

Gene expression, enzyme purification and enzyme assays 

Gene expression, enzyme purification and enzyme assays were 
performed as described earlier [15]. Briefly, the expression clones 
were grown in LB at 28°C until the OD 600 has reached 0.5. At this 
point L-(+) arabinose was added at a final concentration of 2 g/L. 
Cultures were grown overnight, cells were harvested by centrifu- 
gation and cell free extract was prepared in 10 mM imidazole 
buffer containing IX Bugbuster solution (Novagen, Darmstadt). 
The cell free extract was centrifuged at 16,000 g for 20 min at 4°C 
and the supernatant was subject to the Ni 2+ -affmity chromatog- 
raphy to purify 6xHis-tagged enzyme using standard procedures. 
The purified enzymes were quantified using Nanodrop (Thermo 
Scientific, DE) and stored in storage buffer (pH 7.5) containing 
10% glycerol and 1 mM 2-mercaptoefhanol at 4°C. Enzyme 
assays were performed within 3 days of purification in a 500 ul 
reaction mixture (final volume) containing 1.7 U.M of HCH in Tris 
glycine buffer (25 mM Tris, 193 mM Glycine, pH 8.3) at 22°C. 
The reaction was initiated by adding enzyme and terminated by 
adding 0.3% formic acid (final concentration). HCH depletion in 
the reaction was monitored by an electron capture detector 
coupled to a gas chromatograph as described earlier [15]. The 
enzyme assays were conducted in triplicate. 



Computational analysis 

Atomistic models of LinB B90A as well as our most active mutant, 
LinE>G2.2) were constructed by performing in silico mutagenesis on 
the LinB UT26 structure (pdb 2BFN, [12]) with the Rosetta software 
suite. The appropriate residues were mutated and the surrounding 
non-catalytic residues were allowed to repack in accordance with 
the Rosetta fixed backbone protein design protocol [16]. During 
these calculations, the catalytic residues were kept in their 
crystallographic conformations while 500 cycles of repacking and 
optimization were performed and the lowest energy structure was 
retained. To construct models with P-HCH bound, the Rosetta 
enzyme design protocol was used to dock the transition state 
structure into the active site [17]. Restraints were added to keep P- 
HCH as close to the transition state as possible (the transition state 
was obtained from Brittain el al. based on their previous 
characterization of this reaction using electronic structure calcu- 
lations [18]). Again, 500 cycles of repacking and optimization were 
performed and the lowest energy result was used. The reactant 
structure of P-HCH was fitted onto the transition state complex to 
yield a final conformation that was geometrically well posed for 
catalysis and still well described by a classical molecular mechanics 
energy function (Figure SI). 

In order to assess the approximate affinity of LinB B90A and 
LinB G 2.2 f° r P-HCH, the Rosetta Ligand Dock program [19] was 
used to dock P-HCH into the apo structures of each enzyme. For 
each enzyme, 500 docking trajectories were computed. The results 
were first ranked by their total score (which approximates the 
overall stability of the model). The top 5% of these docked 
complexes were then ranked by their binding energy for P-HCH. 
The average binding energy was computed from the 10 best 
models. 

Atomistic molecular dynamics simulations were performed on 
the apo and P-HCH-bound states of LinB B9()A and LinBo2.2- 
These simulations were performed using the GROMACS 4.5.3 
simulation package [20] (default parameters are used unless 
otherwise specified) with the AMBER99sb force field [21], the 
generalized Amber force field for P-HCH parameters [22], and 
the TIP3P explicit water model [23]. Temperature was kept at 
298°K using the velocity-rescaling thermostat [24]. A 2-femtosec- 
ond timestep was used for all dynamics simulations with bond 
lengths constrained using the SHAKE algorithm [25]. Electro- 
static interactions were calculated using the particle-mesh-Ewald 
method [26] with a cutoff of 15 A. After minimization, simulations 
were equilibrated by running for 0.5 ns at constant volume and 
temperature with the positions of all non-solvent atoms harmon- 
ically restrained. Following this equilibration, the restraints were 
removed and a 30 ns production run was performed. Results were 
analysed using the VMD software package [27] and 3D structures 
were rendered using PyMOL (The PyMOL Molecular Graphics 
System, Version 1.3, Schrodinger, LLC). 

Results and Discussion 

Eight of the ten natural LinB variants showed clearly 
measurable activities with both P- and 8-HCH (Table 1). As 
expected from previous reports, LinB UT2 6 yielded lower activities 
with either isomer [7,13,14], as did the closely related LinBsp+. 
The other eight variants all showed 10-20 fold greater activities 
with P- than 8-HCH. 

There was about a three-fold range of activities for each isomer 
across the latter eight variants. However, there was no obvious 
difference in activities across the three sequence-based groups of 
variants recognized by Lai et al [2] (again excluding LinB UT2 6 and 
LinB SP+ ). Instead, there was considerable variation within their 
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two major groups (1 and 3 in Table 1), including some large 
differences between otherwise very closely related variants. The 
sequence differences between these variants frequendy involved 
residues (eg 112, 134, 135, 138 and 253) constituting the enzyme's 
active site. 

Based on the above results, six generation one synthetic variants 
(denoted G1.1-G1.6) on the LinB B 90A background and one 
synthetic variant each on the LinB SS04 _ :i (G1.7) and LmB NM05 
(G1.8) background were made. One of the former (G1.3; T135L) 
was generated by mistake (physio-chemically similar T135A was 
intended) during gene synthesis. These eight generation one 
mutants were analyzed for R- and S-HCH activities in order to 
determine the contributions of individual amino acid differences to 
the activity variation. Three LinB Bg0 A variants (Gl.l— G1.3) 
showed ~2 fold higher activity (Table 1), and these mutations 
(A83P, VI 34L and Tl 35L), along with certain others, were used to 
make five second generation double and triple mutants on the 
LinB B9 oA background. These mutants were also tested on R- and 
8-HCH. 

Overall eleven of the synthetic variants from both generations 
one and two yielded activity values within the range seen for the 
natural variants (0-214 and 0-8 min 1 for fi- and 8-HCH 
respectively; Table 1). However, LinB G 2.3 and LinB G2 .2 yielded 
activities 2-4 and 30 fold, respectively, higher than any previously 
seen, and 4-8 and about 80 fold higher than those of the LinB B90A 
reference variant (Table 1). R-HCH activities were again much 
higher than 5-HCH for all the synthetic variants and there was a 
strong correlation between R- and 8-HCH activities across the 
total of 23 natural and synthetic variants (r 2 = 0.99; P<0.03). 

Table 2 rearranges a subset of the data from Table 1 to show 
the effects of single amino acid differences at eight positions spread 
across the enzyme (T81A, A83P, VI 12A, V134L, T135L, LI 381, 
H247S and I253M). In six of these cases the effects of each 
substitution could be examined in more than one haplotype 
background. All eight substitutions had significant effects in at least 
one, and generally, most of the backgrounds tested. For six of the 
substitutions the effect was 1 0 fold or greater. Notably however, all 
six substitutions tested in more than one background also showed 
significant, and often also large, effects of the background 
haplotype. The table shows clear cases of positive (V134L/ 
T135L in LmB G2 . 2 ) and negative epistasis (A81 and P83 producing 
two-fold increases in R- and 8-HCH activities alone but a two-fold 
decrease together). Also a comparison of the A83P mutation in 
LinB B9 oA and LinB SS0 4-3 illustrates the effect of the genetic 
background; this mutation doubled activity in the LinB B90A 
background but decreased it by 10 fold in the LinB SS04 _ 3 
background. 

The fact that the use of chaperones was necessary for expression 
of a few variants with higher activity suggests that some of the 
mutations nevertheless were deleterious for protein solubility or 
stability. We are now exploring whether the chaperone depen- 
dency of these variants could be ameliorated by stabilizing 
mutations. 

Molecular dynamics simulations were used to probe the 
conformational dynamics of the synthetic LinB G2 2 variant relative 
to LinB B9 oA (our reference variant). These simulations revealed 
large differences in conformational flexibility within the cap 
domain (Figure 1). This domain occludes the active site and has 
previously been shown to be more flexible in LinB relative to the 
related dehalogenases DhaA and DhlA [28]. Here we see that 
LinB G2 .2 shows gready enhanced flexibility in this region when 
compared to LinBsgoA- Unlike some of the other variants of this 
enzyme, LinB B90 A has a histidine at position 247. This residue has 
been hypothesized to enhance activity by reducing solvent access 
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Figure 1. Comparative conformational flexibility within the cap domain of two LinB variants. Sausage diagram for LinB BgoA (A) and 
LinB G 2.2 (B) in which the thickness of the chain as well as the colour denotes the average RMSD of each residue. The active site catalytic residues are 
rendered in ball and stick. The RMSD of each residue as a function of time is plotted for LinB B90A (C) and LinB G2 .2 (D). 
doi:10.1371/journal.pone.0103632.g001 



to the active site [13]. Although this mechanism may be feasible 
for some of the smaller haloalkane dehalogenase substrates (such 
as dichloroethane and trichloropropane), HCH cannot easily 
diffuse into the partially occluded active site in LinB, nor can it 
easily enter through the side tunnels previously characterized for 
this enzyme [13]. As such, it seems reasonable to assume that 
increased flexibility in the cap domain plays an important role in 



the enhancement of catalysis as observed in LuiBg2.2 relative to 
LinB B90A by removing kinetic barriers to substrate binding (in spite 
of the fact that this potentially allows greater solvent penetration 
into the active site). 

Additionally, our simulations indicate that LinBG2.2 is able to 
bind (3-HCH more tightly than LinBggoA- The average binding 
energy of (3-HCH to LinB G2 2 was calculated to be — 4.25±0.13 
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Figure 2. Approximate affinity of LinB B90 A and LinB G22 for p-HCH using Rosetta Ligand Dock program. Highest scoring docked pose for 
LinB B90A (A) and LinB G2 .2 (B). The ligand (3-HCH is shown as Van der Waals spheres, catalytic residues (as well as residue 134 which differs between 
LinB B9 oA and LinB G 2.2) are rendered as thick sticks. Other residues within 5 A of (3-HCH are rendered as thin sticks. The empty space in the binding 
pocket of LinB B 90A that is filled by the V134L mutation of LinB G 2.2 is outlined with red dashes. 
doi:10.1371/journal.pone.0103632.g002 



REU (Rosetta energy units) compared to that of LinB B90 A, which 
was — 2.54±0.22. This modest increase in affinity arises in part 
from more favourable hydrophobic interactions between LI 34 
and (3-HCH (Figure 2). This is also supported by results from 
molecular dynamics simulations with |3-HCH bound, which show 
that LinB G 2.2 is a bk to keep the ligand much closer to the catalytic 
conformation than LinB B90A (Figure 3). Both the distance between 



the nucleophilic oxygen and the ligand centre of mass as well as 
the RMSD from the catalytic conformation are much lower for 
LinB G 2.2 relative to LinB B9 () A (Figure 3). 

Collectively, these results imply that the V134L/T135L 
mutations of LinBG2.2 enhance catalysis by affecting the confor- 
mational plasticity of the cap domain and by holding the substrate 
in a suitable conformation for catalysis. More extensive simula- 
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Figure 3. Molecular dynamics simulations of LinB with p-HCH bound in the active site. The distance between the nucleophilic oxygen of 
Asp108 and the center of mass of p-HCH is shown as a function of time (LinB B 90A in blue and LinB G2 .2 in red) in panel A. Panel B shows the RMSD from 
the near-attack conformation for (3-HCH as a function of time (LinB B90 A in blue and LinB G2 .2 in red) in panel B. 
doi:10.1371/journal.pone.0103632.g003 
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tions as well as additional biophysical and biochemical analysis are 
now needed to provide a more complete picture of the relevant 
conformational motions of this enzyme. 
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